FEAT: Implementing `is_integer` and `as_integer_ratio` for `QuadPrecision` #221

SwayamInSync · 2025-10-31T14:45:13Z

closes #216

I took the reference for implementing as_integer_ratio from CPython's implementation

SwayamInSync · 2025-10-31T14:49:30Z

quaddtype/numpy_quaddtype/src/scalar.c

+    }
+}
+
+// this is thread-unsafe


@ngoldbaum precisely, Sleef_snprintf here is thread-unsafe, I believe just GIL won't help here as this is C routine and GIL only protects the Python objects.
Not 100% sure so need your opinion that whether GIL would be enough or we can lock this region with pthread_mutex

I guess that means that it's not safe to concurrently call Sleef_snprintf simultaneously in two threads (e.g. it's not re-entrant)?

In that case, yeah, I think you need a global lock.

I'd avoid using pthreads directly because then you'd need to do something else on Windows. Instead, I'd use PyMutex on Python 3.13 and newer and PyThread_type_lock on 3.12 and older. See e.g. the use of lapack_lite_lock in numpy/linalg/umath_linalg.cpp in NumPy, which solves a similar problem with the non-reentrant lapack_lite library.

You could also switch to C++ and use a C++ standard library but then you need to be careful you don't deadlock with the GIL by making sure you release it before doing any possibly blocking calls. The built-in lock types have deadlock protection against the GIL so you don't need to go through that trouble.

Also it'd be nice to add a multithreaded test for this as well. You can look at test_multithreading.py in NumPy for some patterns to use for multithreaded tests.

Pushed a multithreaded test for now only for testing this. Will push the lock after the tests get done

quaddtype/numpy_quaddtype/src/scalar.c

SwayamInSync · 2025-10-31T17:17:50Z

It seems not to be crashing somehow @ngoldbaum can you check the test_multithreading.py here. If looks good then I can drop the lock commit and we can merge it

ngoldbaum · 2025-10-31T18:00:03Z

Did you try running under TSan? You'll need to build python, numpy, SLEEF, and numpy-quaddtype all with the same compiler/sanitizer stack. See https://py-free-threading.github.io/thread_sanitizer/

SwayamInSync · 2025-10-31T18:08:49Z

Do we need to add this in CI?

ngoldbaum · 2025-10-31T18:16:36Z

I don't think so, a one-off local test every now and then is fine. It may make sense to add TSan testing along with supporting the free-threaded build and more extensive multithreaded testing, but not until a lot of work has happened towards that.

SwayamInSync · 2025-10-31T18:19:22Z

cool, so I can push the lock adding commit

SwayamInSync · 2025-11-01T00:00:57Z

Did you try running under TSan? You'll need to build python, numpy, SLEEF, and numpy-quaddtype all with the same compiler/sanitizer stack. See py-free-threading.github.io/thread_sanitizer

Did this on macos and running Pytest didn't give any TSan warnings for test_multithreading.py, test_quaddtype.py there were 3 regarding BLAS ops which I'll handle separately in qblas

SwayamInSync · 2025-11-01T00:02:12Z

I'll push the code with the commented options for building with TSan so that in near future, anyone wants just uncomment and test

SwayamInSync · 2025-11-01T00:09:53Z

@jorenham this stub test failure can be fix in your #218 ?

jorenham · 2025-11-01T01:03:24Z

@jorenham this stub test failure can be fix in your #218 ?

Yes; stubtest does what it's supposed to do now 🎉

The errors can easily be fixed by copying these method stubs over from numpy:
https://github.com/numpy/numpy/blob/779929c42dd5dfa32f07f1311233abea9f026310/numpy/__init__.pyi#L4976-L4977

SwayamInSync · 2025-11-01T07:08:07Z

@ngoldbaum is this ready to merge?

SwayamInSync · 2025-11-01T12:58:30Z

@jorenham can you please take a look at this error in validating static types?

jorenham · 2025-11-01T14:34:56Z

@jorenham can you please take a look at this error in validating static types?

The added methods need to be decorated with @override, imported from typing_extensions

juntyr · 2025-11-01T18:08:13Z

quaddtype/numpy_quaddtype/_quaddtype_main.pyi

@@ -1,5 +1,5 @@
 from typing import Any, Literal, TypeAlias, final, overload
-
+import builtins


Why the explicit import? We use bool without the builtins.bool elsewhere in the stubs

Copied from the NumPy's stubs, I think both are just same.

in numpy it's needed because bool is shadowed by np.bool. But that shouldn't be problem here, so no need for the builtins._

ngoldbaum · 2025-11-01T20:13:04Z

I try to avoid looking at code on weekends - I'll look at this next week.

ngoldbaum

Sorry for taking a little while to look at this. I wanted to have time to run TSan testing locally.

For what it's worth, on the free-threaded build, I see a data race inside Sleef_iunordq1 on a global value in the new multithreaded test you added. Possibly this is shibatch/sleef#560. We should probably report it upstream as a bug.

quaddtype/reinstall.sh

quaddtype/numpy_quaddtype/src/scalar.c

quaddtype/subprojects/packagefiles/sleef/meson.build

SwayamInSync · 2025-11-04T20:19:15Z

That's great capture, I was using the SLEEF from the subproject itself, thanks a lot @ngoldbaum
I will resolve them and will add a section in README on building with TSan.

Regarding SLEEF (I didn't know pytorch uses it) but in QBLAS I am also using vectorized functions. I might need to evaluate this part extensively as from the related issues and this one itself.
Locking everytime can hardly damage the performance

We are on SLEEF v3.8 (LTS is 3.9, but focussed on DFT not Quad) in short optimal fixes might can take time, but the current ones are resolvable

SwayamInSync · 2025-11-06T14:16:20Z

@ngoldbaum
Reading the TSan warning, it seems the issue is lazy CPU feature detection. The Sleef_iunordq1 is a wrapper that calls the best available function as per CPU whose pointer is stored in pnt_iunordq1 via some dispatching mechanism that triggers inside disp_iunordq1 and in order to keep it cached this optimal function pointer (pnt_iunordq1) is given global state.

So I think the workflow would be: both the T14 and T74 threads triggered Sleef_iunordq1 **for the first time simultaneously before the function pointer was properly initialized, which leads both dispatchers to race while trying to update this pointer.

I will report this to SLEEF, and I think till then in ours we have 2 options

lock (this will be the ultimate gurantee)
init all the ops during module initialization (this should let all the init of all global pointers) something like

PyMODINIT_FUNC
PyInit_quaddtype(void)
{
    PyObject *m;
    
    // ... other initialization ...
    
    // SLEEF init
    Sleef_quad dummy = sleef_q(1LL, 0ULL, 0);
    Sleef_quad dummy2 = sleef_q(2LL, 0ULL, 0);
    int dummy_int;
    
    // Warmup all SLEEF functions you use
    Sleef_iunordq1(dummy, dummy2);
    Sleef_icmpgeq1(dummy, dummy2);
    Sleef_fabsq1(dummy);
    Sleef_frexpq1(dummy, &dummy_int);
    Sleef_cast_from_doubleq1(1.0);    
    return m;
}

This might not gurantee the other races if function implementation itself had issues

SwayamInSync · 2025-11-06T17:30:24Z

Interesting, I am somehow still now able to see the race condition on x86-64 machine

~/temp/OSS/numpy-user-dtypes$ TSAN_OPTIONS="verbosity=1" pytest quaddtype/tests/test_multithreading.py -s 2>&1 | head -20
==4190773==Installed the sigaction for signal 11
==4190773==Installed the sigaction for signal 7
==4190773==Installed the sigaction for signal 8
***** Running under ThreadSanitizer v3 (pid 4190773) *****
============================= test session starts ==============================
platform linux -- Python 3.14.0+, pytest-8.4.2, pluggy-1.6.0
rootdir: /home/ssingh/temp/OSS/numpy-user-dtypes/quaddtype
configfile: pyproject.toml
collected 1 item

quaddtype/tests/test_multithreading.py .

============================== 1 passed in 11.00s ==============================
Stats: SizeClassAllocator64: 20M mapped (7M rss) in 1967503 allocations; remains 3666
  01 (    16): mapped:    256K allocs:    6784 frees:    6400 inuse:    384 num_freed_chunks   16000 avail:  16384 rss:      8K releases:      2 last released:    248K region: 0x720400000000
  02 (    32): mapped:    768K allocs:  830208 frees:  829952 inuse:    256 num_freed_chunks   24320 avail:  24576 rss:    592K releases:     10 last released:    180K region: 0x720800000000
  03 (    48): mapped:    256K allocs:    6528 frees:    6400 inuse:    128 num_freed_chunks    5333 avail:   5461 rss:     12K releases:      2 last released:    240K region: 0x720c00000000
  04 (    64): mapped:    256K allocs:     256 frees:       0 inuse:    256 num_freed_chunks    3840 avail:   4096 rss:     16K releases:      0 last released:      0K region: 0x721000000000
  05 (    80): mapped:    256K allocs:     128 frees:       0 inuse:    128 num_freed_chunks    3148 avail:   3276 rss:      8K releases:      0 last released:      0K region: 0x721400000000
  06 (    96): mapped:    256K allocs:     128 frees:       0 inuse:    128 num_freed_chunks    2602 avail:   2730 rss:      4K releases:      0 last released:      0K region: 0x721800000000

ngoldbaum · 2025-11-06T18:56:47Z

I am somehow still now able to see the race condition on x86-64 machine

Do you mean you're not able to trigger it? Certain kinds of data races are only possible on ARM machines. x86_64 CPUs don't support weak memory ordering.

IMO you shouldn't do anything special inside quaddtype to handle this problem. I think the race might be benign in that both threads will end up writing the same result to the function pointer. Instead, I'd report it upstream, ideally with a reproducer written using e.g. pthreads so they can quickly reproduce the issue without installing quaddtype.

So I'd say to go ahead and merge this as-is without implementing your fix to initialize the pointers in module initialization.

SwayamInSync · 2025-11-06T19:29:36Z

Yeah I just setup on my mac and ran, now got a lot of races and all related to the dispatching of Sleef functions like
pnt_floorq1, pnt_mulq1_u05, pnt_icmpeqq1, pnt_fabsq1, pnt_divq1_u05
They all represent the global pointer to store the machine corresponding dispatched routine

SwayamInSync · 2025-11-06T19:31:08Z

So I'd say to go ahead and merge this as-is without implementing your fix to initialize the pointers in module initialization.

Sure, I'll resolve the other comments and README section for TSan build and then merge it in

SwayamInSync · 2025-11-06T20:11:59Z

Do you mean you're not able to trigger it? Certain kinds of data races are only possible on ARM machines. x86_64 CPUs don't support weak memory ordering.

Right, I also think that since my x86-64 machine supports __float128 leading SLEEF fallback to libquadmath and no dispatching required in that case.

SwayamInSync · 2025-11-07T08:39:31Z

Cool @ngoldbaum if you take a look at the README then this is good to merge

SwayamInSync added 3 commits October 30, 2025 12:13

num is losing prec

bbdd72c

adding is_integer

cb3243d

need to fix thread-safety

b8a60a6

SwayamInSync added the numpy_quaddtype label Oct 31, 2025

SwayamInSync commented Oct 31, 2025

View reviewed changes

quaddtype/numpy_quaddtype/src/scalar.c Show resolved Hide resolved

adding multithreading test

9f95d21

SwayamInSync added 2 commits October 31, 2025 17:53

lock init on py < 3.13

72b2023

only lock call

652c35f

jorenham mentioned this pull request Oct 31, 2025

CI: fix inclusion path in typecheck workflow #222

Merged

debugging and TSan build info

9c54981

SwayamInSync mentioned this pull request Nov 1, 2025

TYP: QuadPrecision <: numpy.floating #218

Merged

SwayamInSync added 2 commits November 1, 2025 12:54

Merge branch 'main' into 216

798e503

adding stubs

9ffcbfd

decorating stub

476ac4d

juntyr reviewed Nov 1, 2025

View reviewed changes

SwayamInSync requested a review from ngoldbaum November 1, 2025 18:27

ngoldbaum reviewed Nov 4, 2025

View reviewed changes

quaddtype/reinstall.sh Outdated Show resolved Hide resolved

quaddtype/numpy_quaddtype/src/scalar.c Outdated Show resolved Hide resolved

quaddtype/numpy_quaddtype/src/scalar.c Show resolved Hide resolved

quaddtype/subprojects/packagefiles/sleef/meson.build Outdated Show resolved Hide resolved

SwayamInSync mentioned this pull request Nov 4, 2025

ENH: Support arbitrary-length Python ints in QuadPrecision constructor #213

Merged

docs

6856f5f

ngoldbaum merged commit 8487df3 into numpy:main Nov 7, 2025
11 checks passed

SwayamInSync deleted the 216 branch November 7, 2025 20:56

		@@ -1,5 +1,5 @@
		from typing import Any, Literal, TypeAlias, final, overload

		import builtins

Uh oh!

FEAT: Implementing is_integer and as_integer_ratio for QuadPrecision #221

FEAT: Implementing is_integer and as_integer_ratio for QuadPrecision #221

Uh oh!

Conversation

SwayamInSync commented Oct 31, 2025

Uh oh!

SwayamInSync Oct 31, 2025

Choose a reason for hiding this comment

Uh oh!

ngoldbaum Oct 31, 2025

Choose a reason for hiding this comment

Uh oh!

ngoldbaum Oct 31, 2025

Choose a reason for hiding this comment

Uh oh!

SwayamInSync Oct 31, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

SwayamInSync commented Oct 31, 2025

Uh oh!

ngoldbaum commented Oct 31, 2025

Uh oh!

SwayamInSync commented Oct 31, 2025

Uh oh!

ngoldbaum commented Oct 31, 2025

Uh oh!

SwayamInSync commented Oct 31, 2025

Uh oh!

SwayamInSync commented Nov 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

SwayamInSync commented Nov 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

SwayamInSync commented Nov 1, 2025

Uh oh!

jorenham commented Nov 1, 2025

Uh oh!

SwayamInSync commented Nov 1, 2025

Uh oh!

SwayamInSync commented Nov 1, 2025

Uh oh!

jorenham commented Nov 1, 2025

Uh oh!

juntyr Nov 1, 2025

Choose a reason for hiding this comment

Uh oh!

SwayamInSync Nov 1, 2025

Choose a reason for hiding this comment

Uh oh!

jorenham Nov 1, 2025

Choose a reason for hiding this comment

Uh oh!

ngoldbaum commented Nov 1, 2025

Uh oh!

ngoldbaum left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

SwayamInSync commented Nov 4, 2025

Uh oh!

SwayamInSync commented Nov 6, 2025

Uh oh!

SwayamInSync commented Nov 6, 2025

Uh oh!

ngoldbaum commented Nov 6, 2025

Uh oh!

SwayamInSync commented Nov 6, 2025

Uh oh!

SwayamInSync commented Nov 6, 2025

Uh oh!

SwayamInSync commented Nov 6, 2025

Uh oh!

SwayamInSync commented Nov 7, 2025

Uh oh!

FEAT: Implementing `is_integer` and `as_integer_ratio` for `QuadPrecision` #221

FEAT: Implementing `is_integer` and `as_integer_ratio` for `QuadPrecision` #221

SwayamInSync commented Nov 1, 2025 •

edited

Loading

SwayamInSync commented Nov 1, 2025 •

edited

Loading